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Abstract: We consider a diffusion (£t)t>o with some T-periodic time dependent input term contained in the 
drift: under an unknown parameter t? £ O, some discontinuity - an additional periodic signal - occurs at times 
kT+'d, k G IN. Assuming positive Harris recurrence of (£/cT)fcew an d exploiting the periodicity structure, we 
prove limit theorems for certain martingales and functionals of the process (£t)t>o- They allow to consider the 
statistical model parametrized by i? 6 6 locally in small neighbourhoods of some fixed i?, with radius ^ as 
n — > oo. We prove convergence of local models to a limit experiment studied by Ibragimov and Khasminskii 
[IH 81] and discuss the behaviour of estimators under contiguous alternatives. 
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We consider a problem of parameter estimation in a Markov process (£,t)t>o whose drift is T-periodic 
in the time variable. A parameter •& comes in through a periodic signal with periodicity T, and 
represents a time of discontinuity. Our main assumption on the process is positive Harris recurrence of 
(£,kT)k£iN f rom which - exploiting the periodicity structure of the semigroup - we prove positive Harris 
recurrence of the chain of T-segments ( (S,kT+s)o<s<T )ke]N - This allows to deduce limit theorems for 
certain martingales and strong laws of large numbers for certain functionals of the continuous-time 
process (£,t)t>o- Based on these, we deal with convergence of local models at *& — corresponding to 
observation of the process (£,t)t>o U P to time nT, local scale at i9 will be ^ as n — > oo - to a limit 
model whose likelihoods are of type u — > e Wu ~^ u \ with double-sided Brownian motion W. In this 
limit model, investigated by Ibragimov and Khasminskii [IH 81, sections VII. 2-3], parametrization is 
^-Holder continuous in the sense of Hellinger distance. So we are far from the framework of local 



asymptotic (mixed) normality (well-studied since LeCam [L 68] and Hajek [H 70], see [D 85], [LY 
90]), or from L 2 -differentiable experiments. Ibragimov and Khasminskii considered a simple 'signal in 
white noise' setting, proved convergence of maximum likelihood and Bayes estimators at calculated 
the limit variance of the maximum likelihood estimator, and pointed out that in a limit model with 
likelihood ratios of type Bayes estimator is better than the maximum likelihood 

estimator. 

Convergence to this limit model has been considered in several settings since then: see [KK 00] in a 
framework of delay equations, [K 04, section 3.4] in time homogeneous ergodic diffusions with spatial 
discontinuity in the drift, [DP 84, section 3] in the context of a change point in iid observations, 
[D 09] and [CK 09] with two-sided compensated Poisson or compound Poisson processes appearing in 
log-likelihood ratios. The approach of Ibragimov and Khasminskii starts from certain assumptions on 
Hellinger distances and from convergence of likelihood ratios 'uniformly in In several aspects, our 
approach is different. We develop limit theorems in diffusions with T-periodic semigroup which will 
be our key tool in view of convergence of likelihood ratios and estimators. Whereas these allow to 
check and exploit assumptions on Hellinger distances similiarly to the work quoted above, convergence 
of likelihood ratios 'uniformly in i9' is not suitable for our framework of inhomogeneity in time, and is 
systematically avoided. We are focussing on contiguous alternatives, make extensive use of 'LeCam's 
Third Lemma' (see [LY 90, pp. 22-23]), and exploit asymptotic equivariance of suitable estimator 
sequences with respect to contiguous alternatives. Our local asymptotic minimax bound, of the type 
of the asymptotic minimax bound for risk functions in Strasser ([S 85, Cor. 62.6], see also [L 72], [M 82], 
[V 91]), controls a maximal quadratic risk on shrinking neighbourhoods of d with radius proportional 
to y t , and a Bayes sequence attains this bound. 

We describe our setting in more detail. The observed diffusion process is inhomogeneous in time 

(1) d& = [S(0, t) + 6(6)] dt + <r(&) dW t , t>0. 

with some deterministic and T-periodic input t — > S(-d,t) where, for known periodicity T and for 
known functions A(-) > and A*(-) > which are continuous and T-periodic, 

(2) S(<M) = A(t) + A*(t) 1(0,0 +O )(*r(*)) > *>0, with i T (t) := t modulo T 

depends on an unknown parameter This means that some additional input A* is switched on 

periodically at times kT+$, k £ JNq, and is of known duration a. We put := (0,T — a). 

The functions &(•) and cr(-) are Lipschitz; hence for all values of the parameter we have Lipschitz 
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and linear growth conditions for the time-dependent coefficients of the above SDE, and thus existence 
and pathwise uniqueness for its solution. 

We are interested in convergence of local models and convergence of maximum likelihood (MLE) and 
Bayes (BE) estimators for the unknown parameter i? 6 when a trajectory of £ has been observed 
up to time nT. As n — > oo, the right choice of local scale for local models at # turns out to be ^, at 
every point i5e6. For the limit of local models at i?, we find likelihood ratios 

(3) L u/0 := exp j W(uJ#) - ^ \uJ#\ X , u € M 

with double-sided Brownian motion (W u ) U £]r and with scaling constants < J$ < oo. From Ter- 
ent'yev [T 68] over Golubev [G 79] and Ibragimov and Khasminskii [IH 81] to Rubin and Song [RS 95] 
it has become evident that in experiments of type ([3D with unknown parameter u S M, the variance 
of the Bayes estimator with respect to quadratic loss is strictly smaller than the variance of the max- 
imum likelihood estimator. Note that in the limit experiment ([3]) - not a 'quadratic' experiment, not 
L 2 -differentiable - there is no sufficient statistic, no analogue of a central statistic, and no analogue 
of a convolution theorem ([H 70], [J 82]) which in the classical LAN/LAMN case- together with a 
lemma stating that 'arbitrary estimator sequences are in some sense almost equivariant' - is the key 
tool to obtain a local asymptotic minimax theorem (see [LY 90, p. 83]) simultaneously for a large class 
of loss functions. In the context here, we have not more than local asymptotic minimax bounds with 
respect to specified loss functions (see Strasser [S 85, Cor. 62.6], or [L 72], [M 82], [V 91]), and no tool 
for comparison between different bounds associated to different choices of a loss function. 

Using contiguity techniques and exploiting an equivariance property (lemma 5.3 below) of the limit 
experiment ([3]), we consider squared loss and prove a bound (theorem 1.8 below) 

(4) lim liminf inf sup E$ + it 

where u* is the Bayes estimator associated to quadratic loss in the limit experiment ([3]), and where 
inf allows to compare all possible estimators based on observation of £ up to time nT. We prove 
asymptotic equivariance with respect to contiguous alternatives of a Bayes sequence i?* T with respect 
to squared loss (proposition 1.7.b) as n — > oo: thus this Bayes sequence attains the bound (H|). 

In view of asymptotic statistical properties, our model behaves exactly as the simple 'signal in white 
noise' setting of [IH 81, section VII. 2] which corresponds to the special case cr(-) = 1, b(-) = 1 in (JT]) 
above. However, if limit theorems are the key tool to prove statistical properties (convergence of ex- 
periments, convergence of estimators, ...), these are radically different in our case. In their likelihoods, 
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thanks to a(-) = 1, [IH 81] can work with very simple Gaussian processes where calculation of means 
and covariances is enough to determine the limiting behaviour. Similiarly, in the time homogeneous 
ergodic diffusion model of [K 04, section 3.4] with one discontinuity in the drift, well known limit 
theorems for convergence of martingales and of additive functionals for ergodic diffusions are at hand. 
Our time inhomogeneous T-periodic problem ([TJ with non-trivial er(-) requires a completely new ap- 
proach. So an essential part of the present paper is devoted to proving limit theorems which make 
statistical theories work in our setting. Also in view of the behaviour of estimators under contiguous 
alternatives, we have to go beyond what had been done earlier in order to obtain the local asymptotic 
minimax bound @. 



Our interest in periodicity structures in diffusions is linked to the following application. In some 
membrane potential data sets similiar to those investigated in [H 07] which we wish to interprete as 
realizations of certain SDE's (out of many references, we mention [LL 87], [T 89], [LS 99], [DL 05], [DL 
06]), there is evidence for time-dependent 'input' in the drift which the modelization has to take into 
account. In analogy to the result of [BH 06, section 3.2] on large systems of neurons receiving identical 
time-dependent input, questions of periodic input received by a single neuron in an active network 
deserve to be studied. In particular, a discontinuity ([2D with constants A, A* can be interpreted as 
some stimulus switched on/off periodically, and is of biological relevance. 



This paper is organized as follows. Section 1 states all statistical results concerning the model 
defined by P^)+P]): convergence of experiments, convergence of estimators, local asymptotic 
minimax bound (theorem 1.8). Section 2 deals with Harris properties of the chain of T— segments; 
it is formulated in a more general setting and can be read independently. Section 3 contains an 
exponential inequality adapted to our purposes from Brandt [B 05]). Section 4 works out the limit 
theorems which we need to consider local models in our problem ([I]) +( [2]) . The main results in 
this probabilistic part of the paper are theorems 2.1+4.1 (strong laws of large numbers for time 
inhomogeneous diffusions with periodicity structure) and theorem 4.3 with remark 4.4 (convergence of 
martingale terms which occur in the log- likelihood ratios of local models at On this basis, section 5 
contains the statistical part of work to be done, and collects all proofs for the results stated in section 1. 
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1 Outline of statistical results 



In order to exploit 'ergodicity properties' of the time inhomogeneous diffusion £ = (£t)t>o m CP with 
T-periodic time-dependent input ([2]), our principal assumption will be 

(HI) the embedded chain (4T)fceJV ^ s positive recurrent in the sense of Harris 

for any fixed value of the parameter $ 6 0. As an example, (HI) always holds if piecewise continuous 
T-periodic input is added to an Ornstein-Uhlenbeck SDE, see 2.3 below. Under (HI), there is a unique 
invariant probability fjy*' for the chain (CkT)keiN under We introduce the chain of T-segments in 
the path of £ 

X = (X k ) k defined by X k := (C( k -i)T+s)o<s<T , k > 1 

which - as a consequence of the T-periodicity in the drift of our SDE - is time homogeneous. X takes 
values in the path space (Ct,Ct) of continuous functions [0, T] — > JR. We deduce from assumption 
(HI) - see theorem 2.1 in section 2 below - that this T-segment chain is positive recurrent in the 
sense of Harris under i9, with a specified invariant probability m^' on (Ct,Ct)- Limit theorems for 
functionals of the process £ which we need for our analysis of the statistical model (see theorems 4.1 
and 4.3 in section 4 below) are then obtained through strong laws of large numbers in the Harris 
chain X = (X fc ) fceIVo - 

Let denote the law of the process £ = (£f)t>o of (H|)-|-([2]) under $ £ 0, a law on the canonical path 
space (C,C) of continuous functions [0, oo) — > M equipped with its canonical filtration®. Our second 
major assumption 

(H2) a(-) is bounded away from and oo on JR 

guarantees that for any pair of different values C' 7^ ^ in 0, the laws and Q^' are locally equivalent 
with respect to (E, and the likelihood ratio process of Q(' to Q c is 

c/c = r /■*«'■.)-*«■.) iMi _ 1 r'(s(C,s)-s( Cs v d ) 

where we write rj = (rjt)t>o for the canonical process on (C,C), and M< for the (Q c ,<£)-martingale 
part of i] (see [JS 87], [LS 81], [Ku 04]). Let B denote a version of J ^ dMs under the reference 
point ( £ 9: B is a (Z'-Brownian motion under . Since S("d, ■) is - up to the continuous functions 
A(-), A*(-), and up to T-periodic continuation - the indicator function l^^+a), the likelihood ratio 
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L\ takes for (' sufficiently close to £ the simple form 



exp 



+ 



dB„ 



i r fx*{ s )) 



^)) l(C,C)(<r(«))* 



A*(s) 



1 ((+a,C+a)( i T(s))dB s 



1 /"* /A*(s)) 



l(C+a,C+a)( i T(s))(is 



l a(r?s) ^™^-— - 2 y v^fe) 

in case C < £' < C + a > the same holds with C, £' in the intervals and sign ± in front of the 
stochastic integrals interchanged if £' < C < C + a - Our nrs t main result - very easy to see 
in the 'trivial' case where a(-) is constant (cf. [IH 81], lemma 2.4 on p. 334), essentially more 
difficult for Lipschitz functions a(-) satisfying (HI) where the argument has to go back to the chain of 
T-segments in the path of £ (cf. theorems 4.1, 4.3 and remark 4.4 in section 4 below) - is the following. 



1.1 Theorem: Under Lipschitz and linear growth conditions on b(-) and cr(-), under (HI) and (H2), 
the following holds for every !?£0: 

a) we have convergence under as n — > oo of 



L 



nT 



, ©0,n == {u G M : •& + - G 9} 

«£0^ n n 



L"/° := exp | W(uJ#) - ^\uJ#\ \ 



in the sense of finite dimensional distributions to 

(5) L = (L"/ ' 

where (Wu) ug jR is two-sided standard Brownian motion, and J$ the scaling constant 

J, := {(X\^)^^P%) + (X%^+a)r(^P^ + j}(^). 

Here (-Pj^)o<s<t<oo denotes the semigroup of the process (£t)t>o under fj,^ the invariant measure 
for (^fcT)fceWo according to (HI), and we write for short //(/) for J fdjl. 

b) Let W in ([5]) be defined on some (Q, A, Pq). Then (SI, A) carries a limit experiment 

(6) £ = {P a :tjefij defined by dP u := L u/0 dP on (SI, A) 
such that we have convergence of experiments: local experiments at i? 

:= {q*+* | g nT : « G 0^} , n > 1 
converge as n — > oo to the limit experiment E in the sense of Strasser ([S 85], p. 302). 
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We will prove this theorem, based on the results of sections 2 to 4, in 5.2 below. 



1.2 Remark: We give an interpretation for the type of limit experiment in ([S]), putting J$ = 1 
for short. Recall that two-sided standard Brownian motion (W u ) u g^ on (CI, A, Po) means that two 
independent standard Brownian motions (W+) v >o and (W~) v >o exist on (Q,A,Po) such that W u is 
given by if u > 0, and by if it < 0. Define 



(7) -P u := 



£ + "A«, W~j >q | P J in case u > 

£ f , Wjr + «A >q | Po) in case it < 



Here constant drift 1 is added to W + in case u > and to in case u < 0, and is switched off at 
time \u\. Consider the likelihood ratio process of P u to Po relative to the filtration generated by the 
bivariate canonical process on C([0, oo), M 2 ) (see proof of lemma 5.3 below, or use [JS 87], [LS 81]). 
If we are allowed to observe over the infinite time interval [0, oo), we end up with the likelihood ratios 

2>/° = exp | W(u) - ^\u\ | 

given in (|3|). Up to the scaling factor J^, this is the situation of theorem 1.1. 



1.3 Remark: Instead of fixed duration a of the additional signal A* as in ([2]), we might work with 
two-dimensional parameter (i?i,i?2) such that < < #2 < T, and with 

S(0,t) = A(t) + \*(t)l M) (i T (t)) , t>0. 

All results which we give here generalize to this two-dimensional problem: the limit experiment will 
be a product of two models as in 1.2, with scaling constant (A*(^)) 2 i^ 1 ''^ P^f 2) )(j?) in factor i. 



Based on theorem 1.1, we proceed using theorems 19-21 in appendix A 1.4 of Ibragimov and Khasmin- 
skii [IH 81] (up to the uniform in $ part which can be omitted there both in the assertion and in the 
assumption of the theorem) and techniques developped in [IH 81, chapter I, theorems 5.1+5.2] to show 



1.4 Theorem: Under Lipschitz and linear growth conditions on b(-) and cr(-), under (HI) and (H2), 
we have the following for every i)£0: 

a) For K < 00 arbitrarily large, the likelihood ratios in local models £n at i? 

) under 
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converge weakly in C([—K,K]) as n — > oo to the likelihoods 

( L"/° ) under P 

V / ue[-K,K] 

in the limit model £ of ©+(16]). 

b) For arbitrary p £ INq and K$ > 0, there are constants b\{p,Ko) and 62 such that 

(8) I sup |u|plJ* + " )A? > 1 ) < 6 1 (p,i^o)e- fe2 ^ , if < K < 00 

\\u\>K " J 

holds for all 1? £ and all n > 1, together with 

(9) P I sup |u| p L u/0 >l) < b 1 {p,K )e- b2K , K <K < 00 . 
Here -K"o) does not depend on ■& or n, and 62 does not depend on n, p, Kq. 

c) For arbitrary p E INq and -Kq > 0, there are constants bi(p,Ko) and 62 such that 



(10) E* 



[ \u\ p LnT - , du I < 6i(p,iTo)e- fe2 ^ , K < K < 

Je«. n n{\u\>K\ r r^+v)/^,,,/ 



holds for all $ G and all n > 1, together with 



(11) E p ( [ \u\ p ~ U/ ° du) < b 1 (p,K )e-~ b2K , K <K< 00. 

\J{\u\>K} f M Lu/°du' J 

Again bi(p, Kq) does not depend on i9 or n, and 62 does not depend on i?, n, p, i£o- 



The proof of theorem 1.4 will be given in 5.5 below. As consequences of theorem 1.4, we will obtain 
convergence of maximum likelihood (MLE) and Bayes estimators (BE) for the unknown parameter 
when a trajectory of £ is observed up to time nT, n — > 00. The MLE sequence is 

(12) d nT = argmax L C J£° := min |(e8 : L C ^° = maxL^ 1 , n G IN 

C66 ' I ' C'ee J 

with Cq £ Q some fixed point, and G the closure of 0. Presence of 'min' in (|12p guarantees for a 
measurable selection whenever the argmax is not unique. The MLE in the limit experiment 

(13) u = argmax L u/0 

is finite-valued and uniquely determined almost surely, by [IH 81, chapter VII, lemma 2.5]. The 
control ([9]) guarantees in particular that a BE 'with uniform prior on the entire real line' 

r°° U L u /°du 

(14) u* := J ~°° „ , 

r°° L^du 

J — 00 



(sometimes called Pitman estimator) is well denned in the limit experiment. Correspondingly, we 
consider the BE sequence with uniform prior on = (0, T—a) 



(l.>) ,J nr := /e C dC . JS 



for the unknown parameter i? £ 6, based on observation of £ up to time nT (in (|15p . we might as well 
use smooth and strictly positive prior densities on 0, but this generalization turns out to be without 
interest in view of theorems 1.7 and 1.8 below). 



1.5 Theorem: Under Lipschitz and linear growth conditions on b(-) and cr(-), under (HI) and (H2), 
we have the following properties of the MLE and the BE sequence, for every i)e6: 

a) weak convergence as n — > oo: 



£,{n{d nT -$) | Q*) 



C(u\Pn 



C[nW vT -<&)\Q* 



C U* I Pn 



b) finite moments of arbitrary order p £ JV, and convergence of moments as n — > oo: 



U (VnT ~ 1?) 



Theorem 1.5 will be proved in 5.6 and 5.7 below. Note that part a) of the theorem would be enough 
for convergence of risks with respect to loss functions which are continuous, subconvex and bounded, 
i.e. 'bowl-shaped' in the sense of LeCam and Yang [LY 90, p. 82], whereas part b) is useful e.g. for 
quadratic loss to be considered below. 



1.6 Remark: For the limit experiment in the setting of remark 1.2 where J$ equals 1, the following 
is known. Ibragimov and Khasminskii [IH 81, p. 342] calculated the variance of the MLE, whereas 
Rubin and Song [RS 95] calculated the variance of the BE: the ratio of MLE to BE variance is 26 to 
16 • C(3) ~ 19.3 where £(s) = Y^=i ls Riemann's zeta function. □ 
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In our setting with scaling factor J$ in the limit experiment © + ([6]), the results quoted in remark 1.6 
yield the following: the limit law for rescaled MLE errors has finite variance 

(16) J# 2 ■ 26 

at different from the limit law for rescaled BE errors whose variance is 

(17) Jf ■ 16 • C(3) « Jf ■ 19.3 . 

Moreover, by definition of a Bayes estimator with respect to quadratic loss - this is the L 2 -projection 
property of conditional expectations - we have at every finite stage n optimality of BE in the sense 
of integrated risk under quadratic loss functions: comparing all possible estimators i9 n T based on 
observation of the trajectory of £ up to time nT, a minimum 

(18) min I J E$ f n(-d n T — •&) \ d-d \ # n T is £/^ T -measurable 1 
exists and is realized by the integrated risk of the BE (fT5j) with respect to quadratic loss 

(19) J E4([n(0* T -0)] 2 )d0, 

for every fixed n. This is an elementary and pre-asymptotic argument, averaging over the whole 
parameter space 0, and does not contain much information on the behaviour of our estimator in small 
neighbourhoods of a parameter value The following two results deal with shrinking neighbourhoods 
of fixed points in the parameter space, in the sense of contiguous alternatives, and fill this gap. 



1.7 Theorem: Under Lipschitz and linear growth conditions on b(-) and c(-), under (HI) and (H2), 
the following holds for every i?£8. 

a) The BE u* in the limit experiment £ with 'uniform prior over M 1 (j!4[) is equivariant: 

for every u £ 1R : C (^u* — u \ P^j = C (^u* \ Pq^J . 

b) The sequence of BE at stage n defined by (|15p is asymptotically as n — > oo equivariant in the 

(1?) 

local models £n at in the following sense: 



lim sup 

n ^°° \u\<C 



E, 







for every continuous and subconvex loss function I which admits a polynomial majorant, and for 
arbitrary choice of C < oo. 
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An analogous statement also holds for maximum likelhood estimators; we will prove theorem 1.7 in 
5.8 below. Now, for quadratic loss, we consider maximal risks over small neighbourhoods of arbitrary 
points G 6, in the sense of contiguous alternatives, and state a local asymptotic minimax theorem 
(for a particular loss function) which allows to compare arbitrary sequences of C?^ T -measurable 
estimators $ n T, n > 1, for the unknown parameter i? £ 0. 

1.8 Theorem: Under Lipschitz and linear growth conditions on b(-) and cr(-), under (HI) and (H2), 
the following holds for every !?£0, 

a) For squared loss, there is a local asymptotic minimax bound in terms of the BE u* of (|14p 

2 ) * %([«1 2 ) 

where at every stage n > 1, inf is with respect to all possible (/^-measurable estimators. 

b) The BE sequence (#^y) n of (fT5j) attains the bound given in a), in virtue of 1.7 b). 

Part a) of this theorem is a result of type [S 85, Cor. 62.6], see also [L 72], [M 82] or [V 91]. The 
proof will be given in 5.9 below. 



lim liminf inf sup E^<s. 



n{0 



r nT 



(*+-) 

n 



2 Ergodic properties for diffusions with T— periodic drift 

In this section, we discuss ergodic properties for processes (£i)t>o °f type C0)+([2]) under fixed value of 
the parameter The special features JI])+([2]) of section 1 are of no importance here. All we assume 
in the present section is that (£t)t>o is a Markov process with continuous paths, inhomogeneous in 
time, with semigroup (P S; t)o<s<t<oo having measurable densities 

(20) P s ,t{x, dy) = p s ,t(x, y)dy , 0<s<t<oo,x,y£M 
with respect to Lebesgue measure, such that the semigroups is T-periodic: 

(21) p Sjt (x,y) = p k T+s,kT+t(x,y) for all k £ 1N and all < s < t < oo . 

We use the same notation ixit) = t modulo T as in Obviously, this setting covers (HD+Q of 
section 1. The results of section 2 will remain unchanged for Polish state space (E,B(E)) - with some 
reference measure replacing Lebesgue measure in ([20]) - instead of (M,B(M)). 
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We need strong laws of large numbers as t — > oo for some class of functionals of the process £ = (£t)t>o- 
This class will be defined in (|27p below; relevant examples are of the following type. Let At(cIs) denote 
some a -finite measure on (—00,00) which is T-periodic in the sense that 

(22) A T (B) = A T (B + kT) for all B G B((-oo, 00)) and all k G M . 
For T-periodic measures Ar(eZs) and for suitable functions / : M — > M, define 

(23) A = (A t ) t >o , A t = I /(&) K T {ds) , t > . 

Jo 

With fixed < r < r' < T, in view of the log-likelihoods in theorem 1.1, we may thus consider 

(24) A t = J2 /(&r+r) , A T (da) = ^ e (feT+r) (d S ) 

feJVo , kT+r<t fcGlVo 

(with Dirac measure at the point x) or 

(25) A t = /(£ s )l (r)r /)(ir(s))ds , A r (cfe) = ^ l(fcT+r,fcT+r')( i T(s)) 

fceJVo 

which both obviously are not additive functionals of the continuous-time process (£t)t>o- 

We fix some notations. We take £ as defined on some canonical path space (O, A, IF, (P x )xem)- 
'Almost surely' means almost surely with respect to every P x , x G M. An iF^-increasing process 
is an iF^-adapted cadlag process A = (At)t>o with nondecreasing paths and Aq = 0, almost 
surely. We write (Ct,Ct) for the space of all continuous functions a : [0, T] — > M equipped with 
the metric of uniform convergence and its Borel cr-field Ct- Then (Ct,Ct) is a Polish space, and 
Ct = c(7T( : < t < T), the cr-field generated by the coordinate projections irt, < t < T. 

The continuous-time Markov process £ = (£t)t>o induces a (Ct, Ct)- valued Markov chain 
X = (X k ) k eiNo , X k := (£( fe _ 1 ) T+s )o< s <T , k > 1 , X = a G Ct 

which we call the chain of T-segments in the path of £. In virtue of (|2ip . the chain X = (X k )k£]N is 
time homogeneous with one-step-transition kernel Q(-, ■) given by 

Q(a,F) := P ( (&)o<»<T € F I fo = a(T) ) j a G C T , P G C T • 

For Harris processes in discrete time, we refer to Revuz [R 75] or Meyn and Tweedie [MT 93]. For 
Harris processes in continuous time see Azema, Duflo and Revuz [ADR 69] or Revuz and Yor [RY 91, 
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Ch. 10.3]. Now 'ergodicity' of the process (£t)t>o with T-periodic semigroup will be understood as 
ergodicity of the segment chain X = (Xk)kew '- 



2.1 Theorem: Assume that the embedded chain (£fcT)fceJVo 1S positive recurrent in the sense of 
Harris, and write [i for its invariant probability on (M, B(M)). 

a) Then the chain X = (Xk)keiN of T-segments in the path of £ is positive recurrent in the sense of 
Harris. Its invariant probability is the unique law m on (Ct,Ct) such that 



(26) 



for arbitrary = t < t\ < ... <t( < t e+ i = T and A, G B(JR) , 
m ({ir ti G Ai , < i < ^+1}) is given by 
J ...J fj 1 {dx )l Ao (xo) IlLoPtiM+ii^dxi+^lAi+Axi+l) ■ 



b) For every -increasing process A = (At)t>o with the property 

there is some function F : Ct —> Si, nonnegative, Cy-measurable, 
satisfying m(F) := f c F dm < oo , such that 



(27) 



k k 

AkT = EHXk) = E F ((Z{k-l)T+s)0<s<T) ,k>l 
3=1 3=1 



we have the strong law of large numbers 



lim — At 

t— >oo t 



1 



m(F) almost surely . 



Proof: 1) Harris recurrence of the process (£/cT)fceW with invariant probability fi yields 

A G B(M) , (Jt(A) > implies 1a(6ct) = oo almost surely , 

k 

and thus implies that the bivariate chain 

is positive recurrent in the sense of Harris with invariant probability 

fj,( 2 \dx,dy) := n(dx)Po ! T{x,dy) on JRxM . 

2) Write m for the unique law on (Ct,Ct) whose finite dimensional distributions are given by (|26p . 
Since (Ct,Ct) is Polish, conditioning with respect to the pair of coordinate projections (7To,7Tt), the 
probability m allows for a decomposition 

(28) m(F) = f ^(dx,dy)K((x,y),F) 

JMxIR 
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with K(-, •) some transition probability from MxM to Ct- Comparing with (I26p . K((x,y),-) is the 
law of the ^-bridge from x at time to y at time T 

(29) K((x, y),F) := P ( (£ s ) < s <t G F | Co = x , = y ) ; 

for ^( 2 )-almost all (x, y), with notations of (j26|) . IT ((xo, 2^+1), {71"^ 6 Aj , < i < £+1})) equals 

£ 

- f rU (^o) /••• / dx 1 ...dx e Y[p U:ti+1 {x i ,Xi +1 )l Al+1 ( x i+i) 

Po,T{xa,x e+1 ) J J t=o 

whenever (xo,xe+i) is in {po,t( - > •) > 0}, with suitable default definition else. 
3) From (|28p we have for sets F £ Ct 

there is some e > such that 

(30) m(F)>0 => { {(x,y):K((x,y),F)>e} =: B(F,e) 

has strictly positive measure under // 2 ) (dx, dy) 

Hence the Harris property of the bivariate chain C(fc+i)r) fcg j Vo with invariant measure ^ 2 \dx, dy) 
gives in combination with (|3U|) first 

F £ Ct , m(F) > =>- ^ lB(i? £ )(6cT,£(fc+l)T) = oo almost surely 

k 

and then thanks to (|28|) 

F £ Ct , m(F) > => l_p(Xfc) = oo almost surely . 

We have identified some probability measure m on (Ct,Ct) such that sets of positive m-measure are 
visited infinitely often by X = (X n ) n : hence X is Harris. Every Harris chain admits a unique invariant 
measure. Periodicity (|2ip of the semigroup guarantees that m defined in (|26p is invariant for X. We 
have proved that X = {X n ) n is positive recurrent in the sense of Harris with invariant measure m 
given by (|2f)|) : this is part a) of the theorem. 

5) Consider an iF^-increasing process A = (At)t>o related to a function F : Ct — > [0, oo) as in (|27p . 
Then $ = (^) fceJVo defined by 

f fc := A fcT , fcG 1N 

is an integrable additive functional of the chain X = (Xk)k of T-segments in £. Since X is Harris 
with invariant measure m, we have the ratio limit theorem 

(31) lim — ^fe = E m almost surely . 
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But £ m (*i) = E m {F{Xx)) = m{F), and 

lim — ^fc = T lim — 

A;— >oo K t— >co t 

since A is increasing. This proves part b) of the theorem. □ 



2.2 Examples: Under all assumptions of theorem 2.1, we deduce in particular the following laws of 
large numbers for the functionals (|23|) — (|25 p . For < r < r' < T fixed, 

a) if / : M — > M is measurable and in L 1 { /iPo.r )> then for A = (A t )t considered in ([2 

lim - f(€kT+r) = Tf, (v p o,r)(f) almost surely ; 

ke JV , kT+r<t 



b) if / : M — > M is measurable and in L 1 ( ds (/j,Pq jS ) J , then for the functional in (|25p 

i /•* i r' 

hm - / f(£ s )lr rr n(i T (s))ds = — (fj,P 0>s )(f) ds almost surely 

t^oo t Jq T J r 

c) if / : M — > M is measurable and in L 1 ( jT At(^s) (/^Po,s) ) f° r some T-periodic measure At as 



defined in (|22p . then we have for the functional A = (At)t in (|23|) 

If 1 1 f T 

lim - / f(£ s )A T (ds) = — A T (ds) (fj,P 0s )(f) almost surely . 

t^OO t Jq T Jq 

This follows from theorem 2.1: put F(ot) = f(ot(r)) for part a), F(ot) = JT f(a(s))ds for part b), and 
F(a) = f Q T f(a(s))A(ds) for part c), with a G Ct- □ 



In order to calculate explicitely the measures occurring in theorem 2.1 or example 2.2, we have to 
know the semigroup (P s ,t)o<s<t<oa of the time inhomogeneous diffusion (£t)t>o- Only in very few 
cases this can be done explicitely; we mention an Ornstein-Uhlenbeck type example. 



2.3 Example: With a > 0, 7 > 0, and some function S(-) which is T-periodic and piecewise 
continuous, consider an Ornstein-Uhlenbeck type diffusion with T-periodic drift 

d& = (S(t)--y&)dt + adW t , t>0. 

The solution with initial value £0 is 

6 = £oe" 7 ' + f e-^ t ^{S{s)ds + adW s ) 
Jo 
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and the transition semigroup (P s ,t)o<s<t<oo of £ is 

/ ft-s P 2 7 (t-s) _ i 

P s ,t(x,-) = N xe~^ t - s) + / e~^S(t - v) dv , e^ 7 ^ 



o 
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Extending S(-) to a T-periodic function denned on the whole real axis, we define 

M(r) = f e~"< v S(r -v)dv = f e^Sir - v) dv , r > 

Jo i-e-y 1 J 

which is T-periodic. We have weak convergence 

(32) P 0jfeT (x, •) — > M ( M(0) , =: /i , A; -> 00 

for arbitrary x £ M, and similiarly for fixed < s < T 

T , fc T+s(z, •) — ► f M(s) » ^ ) > ^ ^ 00 . 

It is easy to see that the measure \x defined in (f3"2"j) is invariant for the chain (CfcT)fceJV ) an d that sets of 
positive Lebesgue measure (hence sets of positive ^-measure) are visited infinitely often by (CkT)k&iN , 
for every choice of a starting point. Hence (CfcT)fceiVo is positive Harris with invariant probability /x. 
Hence all conditions of theorem 2.1 are satisfied. The last convergence gives 

,2 



fj,P 0jS = N^M(s) ,^-\ , 0<s<T, 



thus all measures occcuring in 2.1 or 2.2 are known explicitely. The mapping s — > /j-Po, s describes an 
'oscillating stationary regime'. □ 



3 An exponential inequality 

In this section, we give an exponential inequality for processes of type dU)+([2]). Parameter $ and 
periodicity T will play no role in this section, and we consider any time dependent diffusion 

(33) di t = b(t,£t)dt + a{i t )dW t , t>0 

where (t,x) — > b(t,x) is continuous in restriction to segments [d n , d n+ i[x M, for some given determin- 
istic sequence (d n ) n with do = and d n f 00, under the assumptions 

(34) \b(t,x) - b(t,x')\ + \a(x) - a(x')\ <L\x-x'\ for all x, x' G M and all t > , 

(35) \b(t,x)\ < L(l + \x\) for all x G M and alii > , 
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(36) 



a(x)\ < M for all x G M . 



Then a strong solution exists for ()33[) - we construct it successively on [<i n ,<i n+ i] as in Karatzas 
and Shreve [KS 91, theorems 5.2.9+5.2.13], taking for n > 1 the terminal value of the preceding 
step as starting value for the following one - and is pathwise unique. Clearly this setting includes 
as a special case equation if we put b(t,x) := S($,t) + b(x), under Lipschitz conditions on 

b(-) and cx(-), if er(-) is bounded. Up to a minor modification which allows for time dependence in 
the drift, the following result is due to Brandt ([B 05], Lemma 2.2.4) who extended the classical 
Bernstein inequality for local martingales (see formula (1.5) in Dzhaparidze and van Zanten [DvZ 01]) 
to solutions of SDE. 



3.1 Lemma: (Brandt [B 05]) Fix < A < § and \ < rj < 1-A. For (£t)t>o of (JMI)-® there is 
some Aq > (depending only on L of ([34^1 + ([35]) . on A and on rj) such that 



holds for all < t\ < oo and all < A < Ao , with positive constants c\ and C2 which do not depend 
on ti > or on A € (0, A ). 

Proof: The proof is from [B 05]. We start for 0<£i<t<ti+A from 



(37) 





Applying to \b(s,£ s )\ < \b(s,£ h )\ + |6(s,f s ) - b(s,£ h )\ the conditions §M) + (USD, this gives 




[tl<t<£l+A Jt x J Jt 

With Gronwall pathwise in ui we obtain (e.g. Bass [B 98, lemma 1.3.3]) 




sup | / 

t!<t<tl+A Jt! 




for < ti < t < ti + A. Hence 





sup 

ti<i<*i+A 



sup 
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Exploiting assumption (j36f) . the classical Bernstein inequality for continuous local martingales (formula 
(1.5) in [DvZ 01)]) gives 

with positive constants c\ and ci which do not depend on t\ or on A. We have also 



P sup 

\ti<t<ti+A Jti 



z :- 



A x e- LA - LA- LA 1 ' 11 ] = \A x (e- LA - LA 1 - 11 ^ - LA 1 



> -A x 
~ 2 



provided A is sufficiently small, since 1 — rj — A>0by assumption. The assertion follows. 



□ 



4 A result on finite-dimensional convergence 

In this section, we prove two limit theorems (4.1, 4.3, with remark 4.4 below) which will allow to work 
with log-likelihoods in local models at $. For fixed value of the parameter i? which is suppressed from 
notation, we consider the diffusion £ = (£t)t>o of (P) 

dtt = [S(t) + 6(6)] dt + a(Ct) dW t 
where S(-) is some deterministic function with the property 

(38) S : [0, oo) — > 1R is T-periodic and piecewise continuous . 

This covers fl2J). We write b(t,x) = S(t) + b(x) as in ([33]) . assume (f3i|) + ([35]l . and strengthen ([36]) to 

(39) — < \a(x)\ < M for all x G M 

for some large M. Systematically, we combine an appropriate control of fluctuations in the process 
(£*)* (by lemma 3.1) with strong laws of large numbers for additive functionals of the chain of 
T-segments in £ (by theorem 2.1), and consider sequences of auxiliary martingales to which we apply 
the martingale limit theorem. 

4.1 Theorem: For £ as above, assume that the embedded chain (^kT)k is positive Harris recurrent 
with invariant probability /U. Then we have for < r < T fixed and arbitrary h > 

(40) L ^ww*"'" = A «g^K^y + op - (1) 

( 41 ) / itt \ 1 (r-± r^(^( S )) ds = ^ — "277 V + °PxO) 

Jo ^(6) (r "' r) n f^ ^^jT+r) 
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as n — > oo, for all x 6 JR, with leading term 



i=o 

almost surely as n — > oo, as in example 2.2 a) 



n ^ a z {£ jT+ r) <J l 



Proof: The proof of the approximations (|40p + (|4ip is in several steps. We take n large enough to 
have r — ^, r + — in (0, T), and fix < A < ^ and ^ < i] < 1— A as in lemma 3.1. 

1) With arbitrary constants K, the following auxiliary result will be needed frequently: 

^ n— 1 

(42) - 1 { sup |^ | > ^ n „ } — >■ almost surely as n -> oo . 

71 j=0 0<s<T 

This is easily derived from the strong law of large numbers in the chain X = (Xk)ke]N of T-segments 
in the process £: for any fixed c < oo, theorem 2.1 b) with F(a) = sup \a(s)\ (which is a continuous 

0<s<T 

function on Ct) gives 

. n— 1 . n— 1 

-JZ 1 ! "mp feT +s |>c} = lEWkM — »• ^({ae^ T : sup \a(s)\ > c}) 

almost surely as n — )• oo. Since the limit on the right hand side decreases to as c f oo and since re'' 
exceeds any fixed level as n tends to oo, (f4~2j) is proved. 

2) Next we prove for arbitrary h > and arbitrary starting point x the approximation 

i ?J i r^i 

Put A n — — . Since by assumption cr( - ) is Lipschitz and bounded away from and oo, the j-th 
summand contributing to the difference in f)43|) 



1 1 



J n 

admits for every j = 0, 1, . . . , ri—1 fixed a bound of type 



dl ' l { -p &t + .i > 

0<s<T \"«/ 



jT+r-A< t <j T +r 

+ ^A^-l { sup \tt-t JT+r _ h \ < A*} 

^ T + T '-^< t <^ T + T ' ™ 
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with suitable constants d\, di (where cfo involves the Lipschitz constant L). By the first type of bound 
combined with step 1, we see that 

n-l 



-E 

3=0 



1 



1 



{ sup \£ jT+s \ > (3M } 



vanishes almost surely as n — >■ 00. Next, the exponential inequality in lemma 3.1 (applied to t\ 
jT + r — ^ for j = 0, 1, ... , n— 1) implies that 



sup 



\£t- £ jT+r _k\ > A n > l^-T+r-^l - (^~) ^ some J = 0, 1, . . . , tt,— 1 ^ 



vanishes as n — > 00. Hence, by the second type of bound, the probability to find any strictly positive 
summand in 
n-l 



1 



1 



0" 2 (£jT+r) cr 2 (£ jT+r _h] 



]T + r-%<t<jT + r 

tends to as n — > 00: hence this sum vanishes in probability as n — > 00. Finally, by the third type of 
bounds, we are left to consider averages 

n—l 



n ^ 

3=0 



1 



J n. 



1 



{ sup \St-t jT+r _h\ < A*} 



3T+r-£<t<jT+r 



which are bounded by c?2A^, an d thus vanish as n — > 00. We have proved 
3) Next we show that for arbitrary h > and arbitrary starting point x 



(44) 



E ^ ^-^ 



j=0 



jT+r 



-A CF 2 (£ S 



1 1 



as n — )• 00. The proof of (|44p follows the same scheme as the proof of (|43p : we consider for j 
0, 1, ... , n—l summands 

r jT+r 



(45) 



1 



ds 



and have for these - since a(-) is Lipschitz and bounded away from and 00 - bounds of type 



n I sup \ZjT+ s \ > [-£-) } 



1 



+ n dl ' \ sup 16-^2,1 > A* , | W _ # | < (£)"} 

J'T+r- £<t<jT+r 



1 



+ - d 2 A* • 1 



n ' M sup ICt-^- T+r _il < A*} 



jT+r-£<t<jT+r 
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which allow to proceed in complete analogy to step 2) above to establish (|44p . 
4) Combining (|43|) + (|44|) . we have proved (|41|) . The proof of (|4U|) is similiar. 



□ 



4.2 Corollary: For £ as above, assume that the embedded chain (£fcr)fc is positive Harris recurrent 
with invariant probability fx. Then we have for < r < T fixed and arbitrary h > 



nt 



1 



almost surely as n — > oo, for every t > fixed. 



| /l (^P ,r)(^) 



Proof: This is a variant of the preceding proof: for t > fixed and n—too, we consider a sequence 
(w-n)n such that m n < ^ < m n + 1, and copy the proof of (|4"Tj) to obtain 



1 



1 



m„ — 1 



n ^ 



+ op«(1) 



almost surely as n — > oo, for arbitrary x G iR. Then thanks to theorem 2.1, the assertion follows as 
in example 2.2 b). □ 



In the following, we view the process £ of (fTj)+ (|38p again as canonical process on the canonical path 
space (C, C), and write (rjt)t>o as in the beginning of section 1. For < r < T and n large enough, 
define martingales (5^ n,r,?i )t>o with respect to (E n 

(E n = (Gnt)t>o , Qt ■= n :0<s<t + e) 

by 



£>0 



n,r,H . . C^Hr,r + ^T(s))dB s if h > 

* ' ' J7*^yl(^JM, r )(*rW)^- if/l <°- 

Here n large enough means ^ < min(r, T — r). Fix a set of points = ro < ri < . . . < ri < r^ + \ = T 

and a set of points — oo < hi < I12 < ■ ■ ■ < h m < 00 in M \ {0}. For n large enough (such that 

- max \hk\ < min (r^+i — Tj) ) compose (^"-martingales 
k^ = 1.... j?ri j — — , 



JR m -valued 



Y" 



t>o 



/ Y™' 1 \ 



\ * / t>0 



JR £m -valued 
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4.3 Theorem: a) For j = 1, ...,£, we have weak convergence in M m as n — > oo 



N ( , ATj) , r j : =( M P ^)(^) 



where A = (.Aj,i')i<i,i'<m is the matrix with entries 



/ij A if /tj > and /ij/ > 

I hi I A I /i,' I if hi < and hj> < 







else 



b) We have weak convergence in M £m as n — > oo 

-> AA(0,D) , 



Yj. 







V 



AT^ y 

Proof: By definition of the ^"-martingales Y n ' r ^ hi or Y n ' r i' h i' , we have for t > 

(y n,^ yn,r,h j() = / — — 1 y, fe (z r ( g ))(to 

7o ^ (Vs) ^) 



if /ii > and /ty > 0, and 



if hi < and /i^ < 0, and in all other cases 

(^Y n ' r i' h i 5 Y n ' rj ' hi ' ) t = . 
By corollary 4.2, we have almost surely as n -> oo for fixed t 



y (/ij A /ij') if hi > and /v > 



T (\hi\ A \hi'\) Tj if /ij < and /ty < 
else . 



with Tj as above. In virtue of the martingale convergence theorem (Jacod and Shiryaev 1987, 
VIII. 3. 22), this implies weak convergence 



I \V2 

— AT,- 



V Jt>o 

in the Skorohod path space of cadlag functions [0, oo) — > M m , where B J is m-dimensional Brownian 
motion and A as defined above. This implies a). For j / j' chose B- 7 , B J ' independent, and compose 



V W J 



M Em -vdlued 
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Then the martingale convergence theorem gives weak convergence as n — > oo 

1 \ V2 



( Y D 



t>0 



T 

in the Skorohod path space of cadlag functions [0,oo) — > M im , with D as defined above, since 

(yn,r 3 -,Jn j yVy.V ) t = whenever j ^ f . 

This implies assertion b). □ 

4.4 Remark: The matrix A defined in theorem 4.3 has the structure of the covariance kernel of 
two-sided one-dimensional Brownian motion. Hence we shall read theorem 4.3 as a result on processes 
which for fixed T are now indexed in the parameter h 

(46) (Y"' ri > h ) , ... , (Y^ h ) 

\ 1 Jhe[-K,K] V 1 Jhe[-K,K] 

and which converge by theorem 4.3 as n — > oo in the sense of finite dimensional distributions to 

(47) <Wh)hz[-K,K\ , ... , rf 2 (w e h ) heW] 

with independent two-sided one-dimensional Brownian motions W J = (W^)heiR, 1 < J ' < i- Here the 
choice of determined intervals [—K,K] is in fact irrelevant. We shall make use of finite dimensional 
convergence (|46p + (|47p when considering martingale terms in the log- likelihoods of local models at 
some fixed point $ G 0. It is through the reinterpretation (|46p +(|47 p of theorem 4.3 that the local 
parameter h in local models at $ begins to play the role of 'time'. □ 



5 Proofs for the statistical results stated in section 1 

In order to prove the results stated in section 1, we consider the process £ defined by (JTJ+dJ]) which 
depends on the parameter # € G. We assume the Harris condition (HI) for all values of t? G 0, and 
Lipschitz and linear growth conditions on b(-) and a(-) in combination with (H2). This implies that 
for all i?, all assumptions made in section 2 hold, as well as those of section 3 and of section 4. As in 
section 1, is the law of £ under •& on the canonical path space (C, C), r\ the canonical process on 
(C, C), and (E the canonical filtration on (C, C). 
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We restart from the representation of likelihood ratios L t of Q^'\Qt with respect to Q^\Gt given 
before theorem 1.1: for Q £ © and C close to Q, L t equals 



(48) 



exp 



+ 



o a 



t / \ * 



A*( S )) 



2 7o \^U)J l ^ {lT{s))ds 
\*{s) 1 /"* / \*(s))\ 2 



in case C < C' < C + a ) with B Brownian motion under the reference point £, and 



(49) 



exp 



A* 00 



l {c , (i T (s))dB s - ^ < (^) 2 1 V, (ir(-))* 



' -7-T l(C'+o,C+o) (*t(*)) 
°WsJ 



' / A*(s))\ 

l( f /+ , f+0 )(»r(*))<fa 



in case C' < C < C + a - We state a lemma related to the geometry - in Hellinger sense - of the model 
determined bv (p]) + (|i9l) . 



5.1 Lemma: Consider C, £' 6 such that C<C / <C + aor C / <C<C / + a holds. Then there are 
positive constants Cj and not depending on C, t such that for all t > 



(50) 



-^C ( SU P L t 

\0<s<t 



C7C 



< c + Ci 



+ i) IC — CI 



(51) 



- - [4'«) 1/2 



< 



E 



+ i ic'-cp, 



(52) 



1-(L^V /4 



J t 



i=2 



+ i IC'-CP, 



(53) 



L 



C'/C 



1/2 



< exp < — A; 



IC — CI 



Proof: 1) For £, in 0, the likelihood ratio process {L t )t>o under is the exponential 
(54) L<7C = SiUssdB^ with 5 S = S(C/ '^ (C,S) - 

Hence (L£ )t>o under solves the SDE 

L f /c = l+j L<'K 5 s dB s , t > 
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from which one obtains by Ito formula under 

(55) V t := (if /C ) V4 satisfies V t = I - ^ J^V s 5 2 s ds + J^V S 5 S dB s , 



(56) V t := (Lf /C V /2 satisfies V t = 1 - - f V s 5 2 ds + - f V s 5 s dB s 



o 2 j 



2) From now on in this proof, we fix £ G and consider Q' such that either (<('<( + a or 
C' < C < C' + a holds. In the first case we have according to (|48p 

A*(s) 

:= ^) [ _1 (C'C) + 1 (C+a.C'+a)] M«)) 

with non-overlapping intervals, in the second case by (|49p 



Let s — > denote the deterministic T-periodic function 

d(s) := l{5^o}(«) = [l(C',C) + hc'+^+a)] (iris)) , s > . 

By our assumptions on A*(-) and cr(-), the process (^^y)s>o is bounded away from both and oo. 
Hence there are some < c < c < oo not depending on £, t such that 

ft 

(57) cd(s) < < cd(s) 



IC'-CI < J\*)d* < 2 ([^J +1 ) IC — C] 



for all i > 0; here [^J denotes the biggest integer stricly smaller than x. 



3) We prove ([50]) . The process Vt := yL^ J of (f56|) is nonnegative and satisfies E^(V S ) = 1 for 
all s. Thus (f5T)j) gives 



1 /■* 

< ^ < 1 + - / V a 5 s dB a 
1 Jo 

and allows to write (with [IW 89, p. 110]) 

2 N 



E c ( sup V t 2 J < c ( 1 + E c ( sup ( / V r 

\0<s<t J V \0<s<< V-A) 



r 5 r (ii? r 



ft 

< c' ( 1 + E c ( / y s 2 ^ 







" (l + E{ (J^ V 2 d(s) ds\ \ < c(l + Jd(s)da\ 



< c 

which combined with ([57]) gives ([50]). 

4) We prove (|5ip . We start from := 7 15 J for which (|56p yields the bound 
1 /■* 1 /■* / I 



•f/o 1/2 



■/o 8 jo \ z Jo 



(58) - - / Ktf.dB, < l-V t < - J V s 8 z s ds + ( - I V s 5 s dB, 

25 



where / denotes the negative part in / = / + — / . Combining (|50p + (|57p we have 



n 2 N 



V, 6 2 ds 



< SUP Vg 

\Q<s<t 



-i 2 



c / d(s) ds 



< C 2 



+ i)Y-ci 2 + c 3 (^J+i) 3 |C'-CI 



for suitable 02,03. For the martingale term in (j58|) . using again ([57|) and E^(V 2 ) = 1, we have 
E, I / V, 5, dB, 



E ( ^V 2 5 2 ds^ <cj*d(s)ds < Cl ( l - +l)|C'-CI 



for all t > 0. Squaring the bound ([58]) . both inqualities together give ([5T 



5) To prove ([52]) . we define Vt := yL t 
bound of form (1581) . We use 



C7C 



1/4 



C 



v, ds 



and have from ([55]) up to different constants again a 



t -|4 

c / d(s) ds 




< £? f sup y? 

V0<s<t 



and with [IW 89, p. 110] 



V s S s dB s 



cE, 



C 



V 2 <5? ds 



< cEr I sup Vg 
\()<s<t 



c / d(s) ds 



Applying (|50p to the first factor and (|57p to the second factor appearing on the right hand side, the 
sum of the right hand sides of both last inequalities is at most 

5 



£<*( | +1) ic'-cp 

j=2 VL J ' 



6) To prove (|53|) . we deduce from the exponential representation (|54|) under that 



C7C 



1/2 



/ (n S s) dB s exp 



<5„ ds 



So the lower bound in (f57|) applies and shows 

£ ? f (-^f /C ) 1/2 ) < exp I - ^ c y d{s) ds I < exp | - k 
for suitable & > 0. We have proved all assertions of lemma 5.1. 



IC'-Cl} 



□ 



From (|5ip in lemma 5.1, we have bounds for the squared Hellinger distance when we observe the 
trajectory of (£)f>o up to time nT: for (' sufficiently close to 



(59) H 2 (Q(\g nT , Qt'\G nT ) 



-Er 



1- L 



-c'/c 



1/2 



3=± 
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for all n > 1, with some constant C which does not depend on £,£',n. In particular, the 
parametrization in {Q^\G n T '■ C 6 ®} is ~~ i n Hellinger distance - Holder continuous with index 
2_ In local models at some fixed reference point in the parameter space, the right hand side of 
(|59p forces local scale to be proportional to i. This leads to local models at d as considered in 1.1. 



5.2 Proof of theorem 1.1: a) From (|48p + (|49p we have for n sufficiently large 



(60) 



L 



nT 



exp 



A*(«) 
A* 00) 



1 ( 1 ?+a,tf+<H-£) V,^) 



(i T {s))dB s 
(i T (s))ds 



in case it = /i > 0, where all intervals are disjoint, and in case u = — h < 



(61) 



TlT 



exp 



7(T 



A'QQ 
o-fe) 

A'QQ) 



1 V+a-itf+a) 



+ 1, 



(i T (s))dB s 
{iris)) ds 



Since the function A*(-) is deterministic, T-periodic and continuous, since the intervals occurring in 
the integrands are of length 0(— ), the logarithm of both expressions ()60p + ()6ip behaves as n — > oo as 

1 



(62) 



M, 



n,D,h 



-(M 



n,0,h\ 



sgn(/i) ( A*(tf + a)y/^ +a ' h - A*(tf)y™ A/l 



with notations of theorem 4.3, for every fixed value of h. Finite-dimensional convergence as n — > oo 



AC 



he[-K,K] 



4 /2 -(w h 



he[-K,K] 



W(uJ#) 



ue[-K,K] 



follows from theorem 4.3 and remark 4.4, with (VF M ) ng jR two-sided standard Brownian motion and 

J, = {(AW^X^ + tA*^^ 
as in theorem 1.1. With similiar arguments, convergence of angle bracketts in (|62p is immediate from 



theorem 4.1. The actual choice of [— K, K] above being irrelevant for finite-dimensional convergence, 
we have proved convergence as n — > oo of the finite dimensional distributions for likelihood ratios in 
local models at 



L 



nT 



under , @# n := {u £ M : + - G 6} 



to the finite dimensional distributions of likelihoodratios (|5])-|-(j6j) in the limit model £ 

L u /°) under P Q , L u '° := exp { W(uJ<>) - \\uJA X 
/ ugm i 2 J 
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as asserted in theorem 1.1 a). 

b) Convergence of local models £^ m the sense of [S 85, p. 302] as n — > oo to the limit model £ 
reduces to finite dimensional convergence of likelihoods as proved in a) since all probability measures 
in $Q^ + ™\T n T ■ h G ©tf, n j and in jP/, : h G are equivalent. □ 

We remark that the same scheme of proof based on theorem 4.3 allows to consider signals of type ([2]) 
having finitely many discontinuities which arise at epochs < i?i < $2 < • • • < $d < T, cf. remark 
1.3 for d = 2, and leads to d-dimensional parameter 1? = . . . , 1?^) and limits of local models at 
1? which are d-fold products - with suitable scaling in the factor models - of independent experiments £ . 

The next two lemmata deal with the limit experiment £ in ([6]) +([5]). 

5.3 Lemma: In £ , for every u G M fixed, there is a two-sided P^-Brownian motion W such that 
(63) l(u+h)/u = exp I w(hJ#) - ~ \hJ#\\ for aRheJR. 

Hence for every u G ]R, < P u +h '■ h £ M> is statistically the same experiment as £ . 



Proof: We will consider the likelihood ratio L u / U ° separately in all possible cases < uq < u, 
< u < no, uo<0<u, u<0< uo, .... We give the detailed proof in case u < < uq, 
using notation (|7|) of remark 1.2, and suppressing the scaling factor as there. Write (t/ 1 ) , r/ 2 ) ) for the 
canonical process on C([0, 00), M 2 ). By ([7]), the canonical process has semimartingale characteristics 




under P u with u < 



under P uo with uq > 



hence the likelihood ratio process of P u to P Uo relative to the canonical filtration on the path space 
C([0, 00), IB?) is given by 

expjY (-Mo^itydm? + J l (0 ,|«|)(t) dfhf ) - ^(n At) - ^(|n|At)| , t>0 

where denotes the martingale part of r/W under P Uo . m^ 1 ^ and m^ 2 ^ being independent P Uo - 
Brownian motions by ([7|), we form a two-sided P uo -Brownian motion M using fh^ on the right and 
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m 



( 2 ) on the left branch. Letting t — > oo in the above likelihood, we get 



ju/u _ exp _ m 



( 2 ) _ ™W 



+ w ) I = exp <M M u - M U() ) - -\u-Uq\ 



since u < < no- Also defined by W(h) := M Uo+ /j — M Uo , h £ M, is a two-sided P Uo -Brownian 



motion, and the last equation takes the form 



L u/uo = exp |^(n-uo) - ~|u-Uo| 



It < < Uq . 



We have proved (|63|) in case u < < uq. The remaining cases are proved similiarly. Reintroducing 
the scaling factor J$ of (JSJ) -|- ([5]) amounts to replace u and uq by u J$ and UqJ#. □ 

5.4 Lemma: In £, we have for all u,u' in -ff? 



(64) 



1 - [ L u 'l u 



//..\V2 



< — |m — it| 



(65) 



1 - [ L n > 



/ /„,\ l/ 4 



(66) 



E u 



ju' ju 



1/2 



< exp { — ktf \u — u\} 



with constants c$, k$ which do not depend on u, v! . 



Proof: Again we suppress the scaling factor. We use (j63|) to write for v! ^ u 



TV! /U 



1/2 = exp ( I W{v! -u) - I \v! - u\ \ e~ s 
I 2 8 



By 5.3, the expectation of the first term on the right hand side under P u is 1: this proves (j66|) and 



1 - [ £"'/« 



and (|64p . Similiar calculations considering 



m„\1/2' 



n 2 N 



2 1 - e" 



|[u'-«|j 



j/4 



for j = 1, 2, 3 give 



E„ 



1 - f L u '' u 



1/4 



2 + 6e _ sl n '- M l - 8e" ^ |u '- n| 



which behaves as est ■ \v! — u\ as v! — > u. This proves (|65p . 



□ 
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Since the left hand side of f)64|) is twice the squared Hellinger distance, the parametrization is Holder 



continuous with index \ 



(67) lim J. H(P U ,,P U ) = J\j^ 

at every point u of the limit experiment £ = |p u : u G of ([SJ+dS]) f° r local models at 



In order to prove theorem 1.4, we shall follow Ibragimov and Khasminskii [IH 81, theorems 5.1+5.2 
in section 1 and 19-21 in appendix A1.4]. 



5.5 Proof of theorem 1.4: Fix i9 G 0; in analogy to [IH 81], we use notations 



Z(u) := L"/° , ueM. 



By (|52p . there is some q' € ]N and some constant C (both not depending on i? or n) such that 



(68) 



£^9 



- z n,U u 2) ) < C(i + ir«')K- M2 | 2 if |m|> 2 | <^ 



,1/4, 



holds for alli , TGiV,'f9GO,nG-#V. By theorem 1.1 a), we have convergence as n — > oo 



(69) 



in the sense of finite dimensional distributions. ()53p gives some (not depending on $ or n) such that 



(70) 



^(zJJftO) < e- fc H , uGG 



for all "Q G and n G IV. In the limit experiment £ , we have the corresponding assertions 



(71) 



zV\u x ) - Z l l\u 2 ) 



< C\u\— U2\ 2 , Ui,U2 £ M 



(72) 



from (I65I) + (I66I) in lemma 5.4. 



%(^ l/2 K> < 



■k\u\ 



, u £ M 



1) Fix K < oo arbitrarily large. In virtue of [IH 81, p. 378], the assertions (|68|) + (|69|) imply weak 
convergence m C([-K, K]) of Z][l under to Z 1 / 4 under P as n — t- oo, and thus weak convergence 
in C([—K,K]) of Z ni $ under to Z under Pq as n ~ > oo. This proves part a) of the theorem. 
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2) Exactly on the lines of the arguments given in [IH 81, p. 372 and p. 42-44], we deduce from 
assertions (f68"]) + (fT0j) above the following: there is some q £ IN and - with respect to any Kq > 
fixed - suitable constants 63,62 > such that 



(73) 



sup Z M (u) >e < -j=b 3 (K+r+l)ie- b ^ K+r ^ 

\K+r<\u\<K+r+l J V 

for any choice of e > , r £ 1N$ , K > Kq . 

The constants 63, b%, q in ()73p do not depend on n or t? (since they come from the right hand sides in 
(|68p + (|70p and from choice of Kq). Similiarly, at the level of the limit experiment £, we deduce in the 
same way from (f7Tj) + (fT2j) above 



(74) 



P sup Z(u) >e) < -±b 3 (K+r+l)ie- b2 ( K+r *> 

\K+r<\u\<K+r+l J VS 



for any choice of e > , r £ Wq , K > Kq . 

3) From (|73|) with e = 1, we have for all $ S and n > 1 

(75) ( sup Z ni) (u) >1 j < ftie" 62 ^, K > K 

\\u\>K ' J 

with constants which do not depend on n or K > Kq. This is seen similiar to [IH 81, p. 43] after 
summation over r £ INq in (j?3j) with e = 1. From ([73]) where e is replaced by e r := ^ + ^ +1 ^ p which 
depends on r, we obtain assertion (JSj) of theorem 1.4 since 

( sup |u| p Z nitf (u) >1 ] < I sup Z n ^(u) > s r J . 

At the level of the limit experiment £ , we start from (|74p and obtain in the same way 

(76) P l sup Z(u) >1 ] < feie" 6 ^, if > K 
together with assertion @ of theorem 1.4. Part b) of theorem 1.4 is proved. 

4) Next we consider denominators in (jlOp or (jlip and show that there is some D > such that 

holds for all < 5 < 1 and for all re, As in [IH 81, p. 45-46], this comes from 

Q»U 2 s Z ni #(u')du' < 5 - J = Q^(j a i [Z n ^u , )-l]du' <-^\ 

< (^J 2 6 \Z n ^{u') - l\dv! > ^ < 2-~ § j 2 ^E# (\Z n ,#(u f ) - 1|) d«' 
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where the last integrand is a total variation distance, thus smaller than Hellinger distance: hence (|5ip 
for values \u'\ < \ of the local parameter gives E$ (\Z n ^(vf) — 1|) < csty/\u'\ and proves the first of 
the two assertions. The second one follows in the same way from (|64p . 

5) Next we show on the lines of [IH 81, p. 47] that for Kq fixed and all K > Kq, r £ JNq 

(77) E* ( [ , Z : A ?„,, du) < cst-e-l b ^ 

(78) E~A [ Z{U) du) < cst-e-^ K+ ^ 

° \J{K+r<\u\<K+r+l} J R Z(u')du' J 

with respect to the same spherical sections as in step 2), and with constants which do not depend on 
n or To see this, write 

Z n Au) 



I r := / Z n ^(u)du , H r : = 

Je0„n{K+r<\u\<K+r+l} J6«.„n 



e# >n n{K+r<\u\<K+r+l} J Q# <n n{K+r<\u\<K+r+i} Je<, n Z n,ti{U )dU 



du . 



Then {J r > 2e} is a subset of { sup Z n ^(u) > e} . Applying ([73]) with 62 > as there and 

K+r<\u\<K+r+l 

with e = e T = ^e~ b2 ( K+r ^ depending on r, we obtain 



Q 1 * ( I r > e - b2{K+r) J < 63 {K+r+l) q e " ^ 2 {K+r) , r G W , K > K . 
From this, using H r < 1 and step 4), we have for any < 5 < 1 

EAH r ) < <3*U W < ^ + ( Ir > e- 6 ^^ ) + 2 1 e " * ^) 

< est - (VS + {K+r+l) q e~ h b *( K + r ) + I e ~M^+^ . 

From this, the choice 5 = S r = e~ 2 b2 ( K+r ) yields assertion (|77p . The proof of (|78p is similiar. 
6) Finally, we replace the integrals in (|10|) or (|11|) 

eJ f W , Z r' : " ; . ..du) or S £ C / |«F . 1 {U \ du 



I I I C ry f t\ 1 t I P 1 I I I - — - 

Vnn{H>^} J e ^ ra Z^u')dv! J n \J{\u\>K} J m Z(u')du' 

for arbitrary p € IN and if > i^o by sums 

Z n ,-&{u) 



or 



£(2f + r + l)*ifc( f 



n{^:+r<|M|<A'+r+i} Je,, „ Z n ^(u')du' 
Y\{K + T + Vf E p [ [ 

rem ° \ J{K+r<\u\<K+r+l} J R Z(u')du' 



du 



Z(u) 

du 
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and apply (|77p and (|78p . This finishes the proof of part c) of theorem 1.4. 



□ 



Bounds (|73p + ()74p above control maxima of the likelihood over spherical regions in the parameter 
space with center Together with weak convergence of likelihoods, they imply according to [IH 81] 
convergence of both maximum likelihood and Bayes estimators together with moments of arbitrary 
order. See also the approach of Strasser [S 85, Theorem 65.5 and Corollary 67.6] to convergence of 
Bayes estimators, based on a uniform integrability condition which is satisfied through (|l(jp or (|lip . 
A short exposition of the key to convergence of maximum likelihood ()12p + (jl3p and Bayes estimators 
QH+CEID is given in [K 08]. 



5.6 Proof of theorem 1.5 a): 1) From Ibragimov and Khasminskii [IH 81, lemma 2.5 on p. 
335-336], the following is known for the MLE in the limit experiment £ : the law C(u\Pq) has no point 
masses, and the argmax in (|13p is unique almost surely. Symmetry in law of two-sided Brownian 
motion around in ([5]) implies that C(u\Pq) is symmetric around 0. Due to we can choose 

K large enough to make 

Q>(\n&«r-<f)\>K) < Q» ( -P > ™P C^l < W"up > 

V ' \\u\>K \u\<K J \\u\>K J 

arbitrarily small, uniformly in n, together with 

P (\u\ >K) < Pol sup L"/° > sup L"/° J < P ( sup L u '° > 1 J . 

\\u\>K \u\<K J \\u\>K J 

This allows to approximate for fixed x 

(n (d nT -0)<x) = Q» ( sup 4t + " )A? > SU P L nT 

^ ' \ u<x u>x 



^ ' \ u<x u>x 

uniformly in n by 

( sup L^ ]l * > sup L^y» 

\ uG\-K.x] uPlx.K] 



and 
by 



P (u<x) = P ( sup L u/0 > sup Z>/° 

\ u<x u>x / 



P ( sup L"/° > sup L u /° 

' UE[—K,x] u£[x,K] 



The continuous mapping theorem and theorem 1.4 a) give weak convergence 

C { sup L$W, sup | (A £ f sup L u /°, sup Z»/o | p c 

V Me[-_R",a:] u£[i:,K] / \ ue[— K,x] u£[x,K] 
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in M 2 as n — > oo, where is a continuity point for the law of the difference 

£ ( sup Z>/° - sup L"/° | P ] 

again by [IH 81, lemma 2.5 on p. 335-336]. This is weak convergence of MLE. 
2) Writing rescaled estimation errors at $ for Bayes estimators (|15p as 

Je L nT d C Je L nT d C Je», n L nT du 

we have 

, f n u lfJr n ^ du 
Q»(n(r nT -$)<x) = Q»\ °*- <x 



f I? + ^dv 



together with 



P (u* <x) = P \ J ~°° „ < x 

for (|14p in the limit experiment. From ([8]) with p = 3we get for all K > A'( 



o 



e 1Ji „n{|? 1 |>i^} 



[ / |u|L„ T ™ ;/ ciu > — ] < Q v {L)^ n " >\u\~ 6 for some |u| > K ) < 6 X e 



independently of n; similiarly, in the limit experiment, ([9]) with p = 3 gives 

fi) f / "I L" ' " du > - 1 />, e" 62 ^ 



/ \u\L u l°du > \- ] < in- 
y{H>K} a/ 



for all i^T > i^o- Chosing X large enough, we approximate 
(80) £[ [ uL^p^du, [ I? 1 p'*du\Q t 



by 



— oo 



(81) C[f uL^du, C^du\ Q A 

\J[-K,K\ J[-K,K] J 

such that the accuracy of this approximation does not depend on n, and similiarly 

/ /'OO roo 

(82) C[ I uL u /°du, I L u /°du | P Q 

by 

(83) c[ I uL u '°du, I L u /°du | P 

' -K,K] J[-K,K] 
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From 1.4 a) and the continuous mapping theorem, we have weak convergence in M 2 as n — > oo of (|81|) 
to (j83|) . for arbitrary if > ifo- This yields weak convergence of (|80p to ()82p . The second component 
in (|82p being strictly positive, we have weak convergence of BE. □ 



5.7 Proof of theorem 1.5 b): For arbitrary Hq > fixed, there are constants 61,62 such that 
(84) sup (171(0^-0)1 > H) < he-~ b2H , H > H 

n 

holds: from (1791) and the trivial 



L 



H "'"' l{ "' ' V| ie„^- " }/ ^' 



ciu < if 



nT 



we get 
/ 

V 



L 



nT 



1 



u| ; ciw > 2if < — Ea 



nT 



^ nJ 



and apply ()10p . For MLE, as in the beginning of step 1) of 5.6, we have 

(85) </ ( n(d nT -$) > h) < I sup Z n{) {u) > 1 j < h e~ b2H , H > H 
V ' \H>H ' J 

by ([ED or (|75|). By weak convergence of rescaled estimation errors established in 5.6, bounds 

carry over to the limit experiment and give 



+ 



(86) 



P (\u*\>H) < b ie - b2H , P (\u\>H) < b ie - b2H , H>H Q . 



Now convergence of moments of order p £ IN for MLE and BE 



y p - 1 Q d (\n0 nT -$)\>y) dy , J y^ 1 Q* ( |n {ti* nT - 0)| > y) dy 



as Ti — y 00 to 



roo /*oo 

/ yP- 1 P (\u\>y) dy , / y^ 1 P ( \u*\ > y ) dy 
Jo Jo 

is a consequence of weak convergence of recaled estimation errors in combination with dominated 
convergence thanks to ([54 } +([55 } + ([55 ]) . □ 



5.8 Proof of theorem 1.7: 1) By lemma 5.3, |-P u +/i : h G being statistically the same 
experiment as £ = |p u : u G laws of rescaled estimation errors 



" I -Pu 



C J -°° „ u I P u 



/ roc hL (u+h)/u dh _ 
I |-oo L (u+h)/u dh 1 



35 



and 

C ( u — u I P u 



C ( aigmaxL^ u+h ^ u \ P u ) 



do not depend on u € M. Hence both BE (|14|) and MLE fll3 j) are equivariant estimators for the 
parameter in the limit experiment S. Thus 1.7 a) holds. 

2) In order to prove 1.7 b), we shall use 'LeCam's Third Lemma' for contiguous alternatives (see [LY 
90, pp. 22-23], or [H 08, 3.6+3.16]) in combination with the above equivariance property of the BE 
in the limit experiment. We have seen in the proof 5.6 that pairs (|80p 

u L^p n ^ du 



e 



f L^^du) under Q* 



converge weakly in M 2 as n — > oo to the pair 

(poo ^ roo \ ____ 

/ u L"/° du , / L"/° cftz under P • 
J —oo J — oo / 

Obviously, for uo £ M fixed, the same argument also yields joint convergence of triplets 
(87) uL^du, I C^du) under Q* 

weakly in iR 3 as n — > oo to 

/"OO ^ />oo „ \ „ 

W , / uL u /°du, / L u /°(iu undergo- 

J —oo J —oo / 

For any convergent sequence (u n ) n tending to the limit uq, this convergence remains valid if we place 
L nT n instead of L nT " into the first component of (|87p . From joint convergence with the se- 
quence of likelihood ratios, LeCam's Third Lemma deduces weak convergence under the corresponding 
contiguous alternatives: thus 

uL^du, [ C^du) under 



converges weakly in M 3 as n — > oo to 



L u °/°, / uL u /°du, / L u /°du) under P uo . 

J — oo J —oo / 



Using the continuous mapping theorem we obtain weak convergence of ratios 

- ,H " /0 - 

under » 



Ie» n ( U - U n) L nT n du 



K, n L nT du 

as n — > oo to _ 

JZ D (u-u )L u /°d U 



J —CO 
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under P„ n . 



oo r „/n j_ 
oo 



Thus we have proved weak convergence as n — > oo of rescaled BE errors 
(88) C ( n« T - (0 + ^)) | — ► £ (V - u | P 



under continguous alternatives, for arbitrary convergent sequences u n — >■ no- An argument of the same 
structure works for rescaled MLE errors. 

3) Consider a loss function £(•) which is continuous, subconvex and bounded. Then (|88|) gives 

(89) ^^(^(n«r-(<?+^)))) — > ^ (*(«*-«<>)) 

for arbitrary convergent sequences u n — )• Uq. Via selection of convergent subsequences in compacts 
[— C, C], for C arbitrarily large, ([89]) shows 

(90) lim sup E# + u (e(n(#* nT - (#+-)))) - Pp (£(u* - u)) = 
n ^°° \u\<C n V V n n 

for loss functions £(■) which are continuous, subconvex and bounded. This is part b) of theorem 1.7, 
in the special case of bounded loss functions. However, this is sufficient to prove part b) of 1.7 in 
general. We have exponential decrease in the bounds (|84|) - where the constants are independent of $ 
and n - and polynomial bounds for £, thus contributions 

y p - 1 Q*'(\n(KT-0 f )\>v) dy 

can be made arbitrarily small by suitable choice of K. At the level of the limit experiments, we use 
(|86j) and equivariance of u* proved in step 1). Combined with (f90j) . this finishes the proof of 1.7 b). 
Again this works similiarly for MLE. □ 



sup sup 

n>n D'eBJyyK 



We turn to the proof of theorem 1.8. 



5.9 Proof of theorem 1.8: Fix # E 0. Throughout this proof, we consider the particular loss 
function £(x) = x 2 . Part b) of theorem 1.8 being immediate from 1.7 b), we have to prove part a). 
The local asymptotic minimax bound in a) will be a consequence of convergence of experiments, of a 
general asymptotic minimax theorem with respect to a fixed loss function £{■) given in Strasser [S 85, 
Corollary 62.6] (see also LeCam [L 72], Millar [M 83, p. 91], van der Vaart [V 91, Theorem 6.1]), and 
the fact that we consider Bayes estimators with respect to squared loss in the limit experiment E. 

1) By [S 85, Corollary 62.6] or [M 83, p. 91], we have for C < oo arbitrarily large but fixed 



liminf inf sup E$ + il 

n—>-00 „q I \sT< n 



(91) > inf sup E p ( [u - u] 2 ) > inf f ^ Ep u ( [u - u] 2 

u \u\<C V ' u J-C lLj 
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where the 'inf ' on the left hand side is over all estimators based on observation of the process £ up to 
time nT, and the 'inf on the right hand side over all transition probabilities from (J7, A) to (M, B(M)) 
in the limit experiment £ of © + ([6]). 



2) For C fixed, define 

u r [UJ) := — 7^—^ , UJ £ \l 

f C c L u '/0(co)du> 

taking values in (— C, C). Write lZc(du') for the uniform law on (— C, C). Consider the probability 
F c (du',dw) := K C {du') P u >{duj) = K c (du) P (du) L u ' /0 (lu) 



on ((-C,C)xQ , B(-C,C)®Ay By the Bayes property, or the I? ^(-C,C)xfi, F c j -projection 
property of conditional expectations, we can continue the inequality of step 1) in the form 



(92) inf E ¥c ([u-u] z j > E Pc ([u c -u] z j > E Pc [\V C ]' 
where we define (all probability laws P u > in the limit experiment 8 being equivalent) 

V c (u,u)) := = / l(- C -u,c-u){h )h dh 

and use the trivial inequality [u* c — u]' ■ > [Y c ] on (-C,C)xO. 

3) Given inequalities (|9ip + (|92p . the proof of part b) of theorem 1.8 will be finished if we prove 

(93) Jim E Fc ([Y C ] 2 ) = E h ([u*] 2 ) 
for the Bayes estimator u* with 'uniform prior on the real line' 



u — u 



(u'-u) !°' /0 '"> A/ = / h n " +h) "^ dh 
m J M L u '/°(uj)du' Jm J M L( u + h )/ u (u)dh 



in the limit experiment £ , cf. (|14j) and 1.7 a). 
4) It remains to prove (f9"3"j) . Introducing 

(94) *c,M ■= H-c-u,c-u)(h) - l ( -c,c)(h) , ue(-C,C) 

and 

,W ._ / ^„ /M h Ah S 1 ) ' 



we have a representation 

(95) Vc(u,w) = («»-«)+ pg^w) + pg^w) on(-C,C)xft 
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where the law under P u of the third term on the right hand side of f)95|) does not depend on u G M 
(cf. lemma 5.3 and the first part of the proof 5.8). Since u* is equivariant in £ and has finite variance, 
it is sufficient to show 

(96) lim E Vc , ( [Y c - (it* - u)} 2 ) = 

to establish (|93|) . For e > arbitrarily small, choose first K = K(e) large enough for 

J («+&)/« 
/iR^"""" 1 " f R LW*dh 

using theorem 1.4 c), and then C = C(e) large enough for C > K together with 



E P / H\h\>K} h 2 — — ... d/i I < e 



Ea \ I h 2 ^ dh < e 



(2) 

In both last inequalities, the left hand side does not depend on u. For Jensen inequality gives 



rC 
-C 

since C > K. Considering Pq u , we have the following bounds for u € (— C, C): first, <3?c> < ^{| |>A'} 
as long as \u\ < C — K, second, &c,u < 1 in the remaining cases C — K < \u\ < C. Again with Jensen, 



is thus by our choice of C and K smaller than 



Combining the last two bounds with (j94j) . this gives 

Ev c (lVc-(u*-u)} 2 ) < 2 j C c ^E~ Pu ([ P ^ + [ P % u ] 2 ) < 6s. 

Since e > was arbitrary, this proves (|9"6]) and thus By (|9~T|) + (f9"lIj) + (f9"3"j) . the proof of theorem 

1.8 is finished. □ 
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